Skip to content

AFUS Dog Transformations, Force TSV Column Order, Process AfusOwner Data#227

Open
quazi-h wants to merge 36 commits intomasterfrom
qh-dspdc-1958-afus-dog-merged
Open

AFUS Dog Transformations, Force TSV Column Order, Process AfusOwner Data#227
quazi-h wants to merge 36 commits intomasterfrom
qh-dspdc-1958-afus-dog-merged

Conversation

@quazi-h
Copy link
Copy Markdown
Contributor

@quazi-h quazi-h commented Aug 19, 2022

Why

DSPDC-1958.

We would like to maintain the order of the original data model (schema) for columns in the final TSVs.
We also need to process afus_owner data and generate a TSV.
Extraction and transformation pipeline for the afus_dog tables.

This PR

Combined changes from branches qh-dspdc-1958-afus-dog and qh-force-tsv-column-order.
Added forms needed for afus_dog extraction.
Added schema fragments, pipeline builder, and transformation scripts for afus_dog.
Extends the tsv_convert script to process afus_owner.
Forces the column order to hardcoded lists for tsv column order.

Checklist

  • Documentation has been updated as needed.

quazi-h and others added 30 commits December 6, 2021 18:39
… of columns for a given table.

Refactored the code that maintains the list of columns - switched from a set to a list to maintain order.
Simplified the logic that was determining when to pull out PK columns to the front of a file since we should only do that when running with the Firecloud option.
For default output, the column order in the data models for each table already has the PKs at the front of the table.
…ers, arm generator logic.

Need to update standardDirectives and dateFilters if any are to be implemented (check w/ Matt).
Added unit and integration tests for the new extraction pipeline.
Removing bad test cases (will not hold up over time).
Reducing idBatchSize to 10 to prevent overloading RedCap.
Updated tests, removed debugging changes.
…r table was added as well.

Created a new TransformationHelper function to process AFUS records where data for a single record is spread across multiple arms.
Added tests: pipeline test, unit tests for the owner transformations, and new transformation helper method.
Updated unit tests to make sure we are trimming whitespace.
Added schema, pipeline builder, and transformation scripts for afus_dog.
Added AfusDogDemographics and AfusDogResidence transformation.
…consistent failures.

Extended transformation pipeline tests.
Refactored a common TransformationLog file to encapsulate the errors and warnings for all surveys and updated imports.
…sformations' into qh-dspdc-1958-afus-dog-merged
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants